Over-optimistic evaluation and reporting of novel cluster algorithms: an illustrative study

نویسندگان

چکیده

Abstract When researchers publish new cluster algorithms, they usually demonstrate the strengths of their novel approaches by comparing algorithms’ performance with existing competitors. However, such studies are likely to be optimistically biased towards as authors have a vested interest in presenting method favorably possible order increase chances getting published. Therefore, superior newly introduced algorithms is over-optimistic and might not confirmed independent benchmark performed neutral unbiased authors. This problem known among many researchers, but so far, different mechanisms leading over-optimism algorithm evaluation never been systematically studied discussed. Researchers thus often aware full extent problem. We present an illustrative study illuminate which authors—consciously or unconsciously—paint algorithm’s light. Using recently published Rock example, we how optimization used datasets data characteristics, parameters choice competing leads Rock’s appearing better than it actually is. Our cautionary tale that illustrates easy can for claim apparent “superiority” algorithm. illuminates vital importance strategies avoiding problems (such as, e.g., studies), also discuss article.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

wuthering heights and the concept of marality/a sociological study of the novel

to discuss my point, i have collected quite a number of articles, anthologies, and books about "wuthering heights" applying various ideas and theories to this fantastic story. hence, i have come to believe that gadamer and jauss are rightful when they claim that "the individaul human mind is the center and origin of all meaning," 3 that reading literature is a reader-oriented activity, that it ...

15 صفحه اول

an investigation into iranian teachers consistency and bias in evaluation of students writings

while performance-based language assessment has led to an increased authenticity and content validity in the practice of writing assessment, the reliability of ratings has become a major issue. research findings have shown different reactions by native english speaker (nes) and non-native english speaker (nns) teachers to students’ writings. the focus of this study is on investigating whether i...

Financial Reporting Fraud Detection: An Analysis of Data Mining Algorithms

In the last decade, high profile financial frauds committed by large companies in both developed and developing countries were discovered and reported. This study compares the performance of five popular statistical and machine learning models in detecting financial statement fraud. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements betw...

متن کامل

an investigation about the relationship between insurance lines and economic growth; the case study of iran

مطالعات قبلی بازار بیمه را به صورت کلی در نظر می گرفتند اما در این مطالعه صنعت بیمه به عنوان متغیر مستفل به بیمه های زندگی و غیر زندگی شکسته شده و هم چنین بیمه های زندگی به رشته های مختلف بیمه ای که در بازار بیمه ایران سهم قابل توجهی دارند تقسیم میشود. با استفاده از روشهای اقتصاد سنجی داده های برای دوره های 48-89 از مراکز ملی داده جمع آوری شد سپس با تخمین مدل خود بازگشتی برداری همراه با تعدادی ...

15 صفحه اول

investigating the relationship between ambiguity tolerance and willingness to translate of iranian prospective english translators: an sem revalidation study

cognitive studies of translation process have recently been awarded a great deal of attention. there exists a psychological angle to almost all translation activities. the present study, thus, deals with analysing the relationships between iranian prospective translators tolerance for ambiguity (ta) and their willingness to translate (wtt). the research was conducted as a mixed methods study, d...

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Advances in data analysis and classification

سال: 2022

ISSN: ['1862-5355', '1862-5347']

DOI: https://doi.org/10.1007/s11634-022-00496-5